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- BRIEF DESCRIPTION OF THE DRAWINGS 
This invention will be more fully described with reference to the drawings in 

which: 

Figure 1 depicts the universal code equivalent of the mitochondrial l-Scel gene 
(SEQIDNOS:3AND4) . 

Figure 2 depicts the nucleotide sequence of the invention encoding the enzyme 
l-Scel and the amino acid sequence of the natural l-Scel enzyme (SEQ ID NOS: 5 and 
2). " * 

Figure 3 depicts the l-Scel recognition sequence and indicates possible base 
mutations in the recognition site and the effect of such mutations on stringency of 
recognition (SEQ ID NOS: 6, 7, AND 8) . 

Figure 4 A-B is the nucleotide sequence and deduced amino acid sequence of 
a region of plasmid pSCM525. The nucleotide sequence of the invention encoding the 
enzyme l-Scel is enclosed in the box (SEQ ID NOS: 9 through 16) . 

Figure 5 depicts variations around the amino acid sequence of the enzyme I- 
Scel (SEQ. ID NO:2) . 

Figure 6 shows Group I intron encoding endonucleases and related 
endonucleases (SEQ ID NOS: 17 through 44) . 

Figure 7 depicts yeast expression vectors containing the synthetic gene for I- 

Scel. 

Figure 8 depicts the mammalian expression vector PRSV l-Scel. 

Figure 9 is a restriction map of the plasmid pAFIOO. (See also YEAST, 6:521- 
534, 1990, which is relied upon and incorporated by reference herein). 

Figures 10A and 10B show the nucleotide sequence and restriction sites of 
regions of the plasmid pAFIOO (SEQ ID NOS:45 through 50) . -- 

Figure 11 depicts an insertion vector pTSMoo, pTKMu), and pTTu) containing 
the l-Scel site for E. coli and other bacteria. 

Figure 12 depicts an insertion vector pTYW6 containing the l-Scel site for 

yeast. 
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Figure 13A-C depicts an insertion vector PMLV LTR SAPLZ containing the I- 
Scel site for mammalian cells. 

Figure 14 A-B depicts a set of seven transgenic yeast strains cleaved by l-Scel. 
Chromosomes from FY1679 (control) and from seven transgenic yeast strains with I- 
Scel sites inserted at various positions along chromosome XI were treated with l-Scel. 
DNA was electrophoresed on 1 % agarose (SeaKem) gel in 0.25 X TBE buffer at 130 V 
and 12°C on a Rotaphor apparatus (Biometra) for 70 hrs using 100 sec to 40 sec 
decreasing pulse times. (A) DNA was stained with ethidium bromide (0.2 ug/ml) and 
transferred to a Hybond N (Amersham) membrane for hybridization. (B) 32 P labelled 
cosmid pUKG040 which hybridizes with the shortest fragment of the set was used as a 
probe. Positions of chromosome XI and shorter chromosomes are indicated. 

Figure 15A-E depicts the rationale of the nested chromosomal fragmentation 
strategy for genetic mapping. (A) Positions of l-Sce\ sites are placed on the map, 
irrespective of the left/right orientation (shorter fragments are arbitrarily placed on the 
left). Fragment sizes as measured from PFGE (Fig. 14A) are indicated in kb (note that 
the sum of the two fragment sizes varies slightly due to the limit of precision of each 
measurement). (B) Hybridization with the probe that hybridizes the shortest fragment of 
the set determines the orientation of each fragment (see Fig. 14B). Fragments that 
hybridize with the probe (full lines) have been placed arbitrarily to the left. (C) 
Transgenic yeast strains have been ordered with increasing sizes of hybridizing 
chromosome fragments. (D) Deduced l-Sce\ map with minimal and maximal size of 
intervals indicated in kb (variations in some intervals are due to limitations of PFGE 
measurements). (E) Chromosome subfragments are used as probes to assign each 
cosmid clone to a given map interval or across a given /-Seel site. 

Figure 16A-C depicts mapping of the /-Seel sites of transgenic yeast strains by 
hybridization with left end and right end probes of chromosome XI. Chromosomes from 
FY1679 (control) and the seven transgenic yeast strains were digested with /-Seel. 
Transgenic strains were placed in order as explained in Fig. 15. Electrophoresis 
conditions were as in Fig. 14. 32 P labelled cosmids pUKG040 and pUKG066 were used 
as left end and right end probes, respectively. 

Figure 17A-H depicts mapping of a cosmid collection using the nested 
chromosomal fragments as probes. Cosmid DNAs were digested with Eco RI and 
electrophoresed on 0.9% agarose (SeaKem) gel at 1 .5 V/cm for 14 hrs, stained with 
ethidium bromide and transferred to a Hybond N membrane. Cosmids were placed in 
order from previous hybridizations to help visualize the strategy. Hybridizations were 
carried out serially on three identical membranes using left end nested chromosome 
fragments purified on PFGE (see Fig. 16) as probes. A: ethidium bromide staining 
(ladder is the BRL "1 kb ladder"), B: membrane #1 , probe: Left tel to A302 site, 
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C: membrane #1, probe: Left tel to M57 site, D: membrane #2, probe: Left tel to H81 
site, E: membrane #2, probe: Left tel to T62 site, F: membrane #3, probe: Left tel to G41 
site, G: membrane #3, probe: Left tel to D304 site, H: membrane #3, probe: entire 
chromosome XI. 

Figure 18 depicts a map of the yeast chromosome XI as determined from the 
nested chromosomal fragmentation strategy. The chromosome is divided into eight 
intervals (with sizes indicated in kb, see Fig. 15D) separated by seven /-Seel sites (E40, 
A302 ...). Cosmid clones falling either within intervals or across a given l-Sce\ site are 
listed below intervals or below interval boundaries, respectively. Cosmid clones that 
hybridize with selected genes used as probes are indicated by letters (a-i). They 
localize the gene with respect to the /-Seel map and allow comparison with the genetic 
map (top). 

Figure 19 A-B depicts diagrams of successful site directed homologous 
recombination experiments performed in yeast. 

Figure 20A-E. Experimental design for the detection of HR homologous 
recombination (HR) induced by l-Sce I. a) Maps of the 7.5 kb tk-PhleoLacZ retrovirus 
(G-MtkPL) and of the 6.0 kb PhleoLacZ retrovirus (G-MPL), SA is splice acceptor site. 
G-MtkPL sequences (from G-MtkPL virus) contains PhleoLacZ fusion gene for positive 
selection of infected cells (in phleomycin-containing medium) and tk gene for negative 
selection (in gancyclovir-containing medium). G-MPL sequences (from G-MPL virus) 
contains only PhleoLacZ sequences, b) Maps of proviral structures following retroviral 
integration of G-MtkPL and G-MPL. I-Sce I PhleoLacZ LTR duplicates, placing l-Sce I 
PhleoLacZ sequences in the 5' LTR. The virus vector (which functions as a promoter 
trap) is transcribed (arrow) by a flanking cellular promoter, P. c) l-Sce I creates two 
double strand breaks (DSBs) in host DNA liberating the central segment and leaving 
broken chromosome ends that can pair with the donor plasmid, pVRneo (d). 
e) Expected recombinant locus following HR. 

Figure 21A-B. A. Scheme of pG-MPL. SD and SA are splice donor and splice 
acceptor sites. The structure of the unspliced 5.8 kb (genomic) and spliced 4;2 kb 
transcripts is shown below. Heavy bar is 32 P radiolabeled LacZ probe (P). B. RNA 
Northern blot analysis of a pG MLP transformed ip-2 producer clone using 
polyadenylated RNA. Note that the genomic and the spliced mRNA are produced at the 
same high level. 

Figure 22A-B. A. Introduction of duplicated l-Sce I recognition sites into the 
genome of mammalian cells by retrovirus integration. Scheme of G-MPL and G-MtkPL 
proviruses which illustrates positions of the two LTRs and pertinent restriction sites. 
The size of Bel I fragments and of l-Sce I fragments are indicated. Heavy bar is 32 P 
radiolabeled LacZ probe (P). B. Southern blot analysis of cellular DNA from NIH3T3 

5 



Attorney Docket No. 03495.0111-14 



fibroblasts cells infected by G-MtkPL and PCC7-S multipotent cells infected by G-MPL. 
Bel I digests demonstrating LTR mediated PhleoLacZ duplication; l-Sce I digests 
demonstrating faithful duplication of l-Sce I sites. 

Figure 23 A-B . Verification of recombination by Southern. A.: Expected 
fragment sizes in kilobase pairs (kb) of provirus at the recombinant locus. 1) the 
parental proviral locus. Heavy bar (P) is 32 P radioactively labelled probe used for 
hybridization. 2) a recombinant derived after cleavage at the two l-Sce I sites followed 
by gap repair using pVR neo (double-site homologous recombination, DsHR). 3) a 
recombination event initiated by the cleavage at the l-Sce I sites in the left LTR (single- 
site homologous recombination, SsHR). B.: Southern analysis of DNA from NIH3T3/G- 
MtkPL clones 1 and 2, PCC7-S/G-MPL clones 3 and 4 and transformants derived from 
cotransfection with pCMV(l-Sce l+) and pVRneo (1a, 1b, 2a, 3a, 3b and 4a). Kpn I 
digestion of the parental DNA generates a 4.2 kb fragment containing LacZ fragment. 
Recombinants 1a and 3a are examples of DsHR Recombinants 1b, 2a, 3b and 4a are 
examples of SsHR. 

Figure 24 A-B . Verification of recombination by Northern blot analyses. A.: 
Expected structure and sizes (in kb) of RNA from PCC7-S/G-MPL clone 3 cells before 
(top) and after (bottom) l-Sce I induced HR with pVRneo.l Heavy bars P1 and P2 are 
32 P radioactively labelled probes. B.: Northern blot analysis of the PCC7-S/G-MPL 
clone 3 recombinant (total RNA). Lane 3 is parental cells, lane 3a recombinant cells. 
Two first lanes were probed with LacZ P1 , two last lanes are probed with neo P2. 
parental PCC7-S/G-MPL clone 3 cells express a 7.0 kb LacZ RNA as expected of 
trapping of a cellular promoter leading to expression of a cellular-viral fusion RNA. The 
recombinant clone does not express this Lacz RNA but expresses a neo RNA of 5.0 kb, 
corresponding to the size expected for an accurate replacement of PhleoLacZ by neo 
gene. 

Figure 25A-C. Types of recombination events induced by l-Sce I DSBs, a) 
Schematic drawing of the structure of the recombination substrate. The G-MtkPL has 
provirus two LTRs, each containing an l-Sce I recognition site and a PhleoLacZ gene. 
The LTRs are separated by viral sequences containing the tk gene. The phenotype of 
G-MtkPL containing cells is Phleo R , Gls s , (3-Gal±. b) Possible modes of 
intra-chromosomal recombination. 1) The l-Sce 1 endonuclease cuts the l-Sce I site in 
the 5'LTR. The 5' part of U3 of the 5'LTR can pair and recombine with it homologous 
sequence in the 3'LTR (by single-strand annealing ( SSA). 2) The l-Sce I endonuclease 
cuts the l-Sce I site in the 3'LTR. The 3' part of U3 of the 3'LTR can pair and recombine 
with its homologous sequence in the 5'LTR (by SSA). 3) The l-Sce I endonuclease cuts 
l-Sce I sites in the two LTRs. The two free ends can relegate (by an end-joining 
mechanism). The resulting recombination product in each of the three models is a 
solitary LTR (see right side). No modification would occur in the cellular sequences 
flanking the integration site, c) The l-Sce I endonuclease cuts the l-Sce I sites in the 
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two LTRs. The two free ends can be repaired (by a gap repair mechanism) using the 
homologous chromosome. On the right, the resulting recombination product is the 
deletion of the proviral integration locus. 

Figure 26A-C. Southern blot analysis of DNA from NIH3T3/G-MtkPL 1 and 2, 
and PhleoLacZ recombinants derived from transfections with pCMV(l-Sce l+) selected 
in Gancyclovir containing medium, a) Expected fragment sizes in kilobase pair (kbp) of 
parental provirus after digestion with Pst I endonuclease. Pst I digestion of the parental 
DNA NH3T3/G-MtkPL 1 generates two fragments of 10 kbp and of the parental 
NIH3T3/G-MtkPL 2 two fragments of 7 kbp and 9 kbp. b) Southern blot analysis of DNA 
digested by Pst I from NIH3T3/G-MtkPL 1 , and recombinants derived from transfection 
with pCMV(l-Sce l+) (1 .1 to 1 .5). c) Southern blot analysis of DNA digested by Pst I 
from NIH3T3/G-MtkPL 2, and recombinants derived from transfection with pCMV(l-Sce 
l+) (2.1 to 2.6). Heavy bar is 32 P radiolabeled LacZ probe (P). 

Figure 27A-C. Southern blot analysis of DNA from NIH3T3/G-MtkPL 1 and 2, 
and PhleoLacZ* recombinants derived from transfections with pCMV(l-Sce l+) and 
pCMV(l-Sce I-) and selection in Phleomycin and Gancyclovir containing medium, 
alj Expected fragment sizes in kbp of parental provirus after digestion with Pst I or Bel I 
endonuclease. Pst I digestion of the parental DNA NIH3T3/G-MtkPL 1 generates two 
fragments of 10 kbp. Bel I digestion of the parental DNA NIH3T3/G-MtkPL 2 generates 
three fragments of 9.2 kbp, 7.2 kbp and 6.0 kbp. a2) Expected fragment sizes in kbp of 
recombinants after digestion with Pst I or Bel I endonuclease. Pst I digestion of DNA of 
the recombinant derived from NIH3T3/G-MtkPL 1 generates one fragment of 13.6 kbp. 
Bel I digestion of the DNA of the recombinants derived from NIH3T3/G-MtkPL 2 
generates two fragments of 9.2 kbp and 6.0 kbp. b) Southern blot analysis of DNA from 
NIH3T3/G-MtkPL 1, and recombinants derived from transfection with pCMV(l-Sce I-) 
and pCMV(l-Sce l+) (1c, 1d). c) Southern analysis of DNA from NIH3T3/G-MtkPL 2, 
and transformants derived from transfection with pCMV(l-Sce I-) (2a, 2b) and pCMV(l- 
Sce l+) (2c to 2h). Heavy bar is 32 P radiolabeled LacZ probe (P). 

Figure 28. Figure 28 is a diagram illustrating the loss of heterozygosity by the 
insertion or presence of an l-Sce I site, expression of the enzyme l-Sce I, cleavage at 
the site, and repair of the double strand break at the site with the corresponding 
chromatid. 

Figure 29. Figure 29 is a diagram illustrating conditional activation of a gene. 
An l-Sce I site is integrated between tandem repeats, and the enzyme l-Sce I is 
expressed. The enzyme cleaves the double stranded DNA at the l-Sce I site. The 
double strand break is repaired by single stand annealing, yielding an active gene. 

Figure 30. Figure 30 is a diagram illustrating one step rearrangement of a 
gene by integration of an l-Sce I site or by use of an l-Sce I site present in the gene. A 
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plasmid having either one l-Sce I site within an inactive gene, or two l-Sce I sites at 
either end of an active gene without a promoter, is introduced into the cell. The cell 
contains an inactive form of the corresponding gene. The enzyme l-Sce I cuts the 
plasmid at the l-Sce I sites, and recombination between the chromosome and the 
plasmid yields an active gene replacing the inactive gene. 

Figure 31. Figure 31 is a diagram illustrating the duplication of a locus. An I- 
Sce I site and a distal part of the locus are inserted into the gene by classical gene 
replacement. The l-Sce I site is cleaved by l-Sce I enzyme, and the break is repaired 
by homologous sequences. This results in duplication of the entire locus. 

Figure 32. Figure 30 32 is a diagram illustrating the deletion of a locus. Two I- 
Sce I sites are added to flank the locus to be deleted. The l-Sce I enzyme is expressed, 
and the sites are cleaved. The two remaining ends recombine, deleting the locus 
between the two l-Sce I sites. 

Figure 33. Figure 33 is a diagram of plasmid pG-MtkAPAPL showing the 
restriction sites. The plasmid is constructed by deletion of the polyadenylation region of 
the tk gene from the pGMtkPL plasmid. - 



On page 18, replace the last paragraph with the following amended paragraph: 

- The enzyme l-Scel has a known recognition site. (ref. 14.) The recognition 
site of l-Scel is a non-symmetrical sequence that extends over 18 bp as determined by 
systematic mutational analysis. The sequence reads: (arrows indicate cuts) 

I 

5' TAGGGATAA CAGGGTAAT 3' (SEQ ID NO:51) 
3' ATCCC TATTGTCCCATTA 5' (SEQ ID NO:52) 

t 

On pages 47-48, replace the paragraph bridging these pages with the following 

amended paragraph: 

— e- The supernatant of this clone was used to infect other mouse cells (1009) 
by spreading 10 5 virus particles on 10 5 cells in DMEM medium with 10% fetal calf serum 
and 5 mg/ml of "po l ybra i n" polvbrene (hexadimethrine bromide) . Medium was replaced 
6 hours after infection by the same fresh medium. -- 
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On pages 71-72, replace the paragraph bridging these pages with the following 
amended paragraph: 

- This example describes the use of the l-Sce I meganuclease (involved in 
intron homing of mitochondria of the yeast Saccharomyces cerevisiae) (6B, 28B) to 
induce DSB and mediate recombination in mammalian cells. I-Sce I is a very 
rare-cutting restriction endonuclease, with an 18 bp long recognition site (29B, 22B). In 
vivo, \-Sce I endonuclease can induce recombination in a modified yeast nucleus by 
initiating a specific DBS leading to gap repair by the cell (30B, 17B, 21 B). Therefore, 
this approach can potentially be used as a means of introducing specific DSB in 
chromosomal target DNA with a view to manipulate chromosomes in living cells. The 
I-Sce l-mediated recombination is superior to recombinase system [1 1 B ] for 
chromosome engineering since the latter requires the presence of target sites on both 
host and donor DNA molecules, leading to reaction that is reversible. - 

On pages 72-73, replace the paragraph bridging these pages with the following 

amended paragraph: 

- pG-MtkPL was obtained in five steps: (I) insertion of the 0.3 kbp Bgl ll-Sma I 
fragment (treated with Klenow enzyme) of the Moloney Murine Leukemia Virus 
(MoMuLV) envgene (25B) containing a splice acceptor (SA) between the Nhe I and 
Xoa I sites (treated with Klenow enzyme), in the U3 sequence of the 3'LTR of MoMuLV, 
in an intermediate plasmid. (II) Insertion in this modified LTR of a 3.5 kbp Nco \-Xho I 
fragment containing the PhleoLacZ fusion gene [13B ] (from pUT65; Cayla Laboratory, 
Zone Commerciale du Gros, Toulouse, France) at the Xba I site next to SA. (Ill) 
Insertion of this 3'LTR (containing SA and PhleoLacZ), recovered by Sal \-EcoR I 
double digestion in the p5'LTR plasmid (a plasmid containing the 5'LTR up to the 
nucleotide no 563 of MoMuLV [12B ]) between the Xho I and the EcoR I site. (IV) 
Insertion of a synthetic I-Sce I recognition site into the Nco I site in the 3'LTR (between 
SA and PhleoLacZ), and (V) insertion (antisense to the retroviral genome) of the 1.6 kbp 
tk gene with its promoter with linker adaptators at the Pst I site of pG-MPL. - 

On page 80, replace the paragraph beginning on line 5 with the following 

amended paragraph: 

- The generation of tk'IPhleoLacZ cells is probably a consequence of either a 
homo-allelic and/or an ectopic gene conversion event (36B). Isolation and detailed 
molecular analysis of the proviral integration sites will provide information on the relative 
frequency of each of these events for the resolution of chromosomal DSBs by the cell. 
This quantitative information is important as, in mammalian cells, the high redundancy 
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of genomic sequences raises the possibility of a repair of DSBs by ectopic homologous 
sequences. Ectopic recombination for repair of DSBs may be involved in genome 
shaping and diversity in evolution [29]. - 

After making the above amendments, please move pages 50-58 and insert these 
pages after page 88 and before page 89 of the application. 

Renumber pages 50-58 as pages 80-88, respectively. 

Renumber pages 59-88 as pages 50-79, respectively. 

On pages 89-93, replace the section entitled "References" with the following 
amended section. 
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FIGURE 1 



The Universal Code Equivalent of 
the Mitochondrial I -See I Gene. 
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1 and 2: These amino acids are absolutely necessary to produce 
catalytic activity. Other substitutions are possible, 
such as deletions of the 10 first amino acids. 



# « 



FIGURE 3 



5' 
3» 



T T GA 
A AATGCTTTCCA 




T 

C A AA 

t f ft 

GTTACG 



] 
J 



severely affected 



CA G J parLULl y affected 



P 



unaffected 
5» 



FIGURE . 4 



i ^TOItWfCtC AM C?c CTC A* ^ 

11 1 V t 5 A t 

CCt ATC OA AAA CSC CAC CAA CCC « CT W irr. *tt CCT ccc ~« ^ 

• t M X K | o q t 5 i r ? / T ^ « *?c :tc *c c p 

?* ?* r 1 v s" r ?* ? T ;* ? c r* ««™»^ 

^TeTCCCeoOQttTTCOCCUnUtrTA ATC CAC CTC XA CCA CAC GtT TCC CiA CTC AA ACC 

"ai*ARQVS.|| L || 



.a;-: 



1HI ? 



2020 

I 

2QM 
13 

91 8A 
2 

222$ 
2 

22 M 



COC M m 8COCAAC4CAATIA aj* UTTA<JC7CACTCA?TAfiKACCCCACGCTTTACACHT A?C 
£TT CCC Art CCT ATC TTC TCT MA AJT art ACC «A TAA UUTTTCACAOttUUCACC? ATC 
ACC AWJ J~ 1A1 AAT TCT CAT CTT TCA CAOCTTATCATCAATAACCTTTA AIT* U* fAC -TATCAC 



ACTTAAATTCCIAACCCACTCACOCACrfrPC* ATC AAA TCT AAC AAT OC 

■ I • ■ « a 



CTC ATC 5TC ATC CTC <iC«. 

t t v : t 4 



2J*4 

13 

iUI 

yj 

3«M 
33 

39*4 

73 

3»7 



ACC CTC ACC CTC CAT ACT CTA CCC AXA COC l-« «*T ATC COB CTA CTC CCC <M 
^V TL0AVC:CLV«9vt f C 



ctc rr« 



CCC CAT ATC « CTC ATC CCT CAA OR CAC CCA CO AM <TAT CAC UC TOT CTC CTC 
*° * R *****0CIB I1 nwcw i 

TTW a*-* TCC OCA TCC CM CAC AAC TTC ?GC ?CC GOT AAC CTC CTC AfiC COC OTC iat. -!T 

F ** A c«OHriisaiVLirAtr 

ACT CCC CAT UW CCT 6TT CAC AA? TAA fCA tCSOCT C S f AtA ATC TOT CCA ATT CTC AfiC CCA 

T r 8 F F v ^!puv ' ■ c c r v $ c 

TAA CAATTTC/ 



1423 
2433 

3*«j 

*>: 



A TW UAX ATC AAA AAC ATC AAA AAA AAC CAC CTA ATC 



2ro 

12 



34H1 

1 1 


AAC CTC OCT 
tf L C 


CCC AAC 
9 8 


TCT AAA 
• X 


CTC 
I 


CTC 

L 


AAA 

K 


&AA TAC 
B T 


AAA 
8 


TCC 
A 


CAC 

w 


CTC 


ATC 

I 


CAA 
f 


CTC 

I 


AAC 

N 


2 73: 
32 


2131 
33 


ATC SAA r%C 
T 1 Q 


++r. «AA 

r t 


CCA CCT 
A 0 


ATC 

X 


CCT 
C 


etc 

L 


ATC CTC 

I I 


• 


CAT 

D 


CCT 
A 


TAC 
T 


ATC 

: 


car 
* 


XT 
% 


OCT 
A 


2730 
52 


2m 

33 


CAT CAA CCT 

die 


AAA ACC 
I T 


fir +fs* 
X c 


ATC 
H 


CAC 

0 


TTC 

r 


CAC TOC 
C 8 


AAA 
E 


AAC 
8 


AAA 
X 


OCA 
A 


TAC 

T 


ATC 
M 


GAC 
0 


CAC 

n 


283U 
T3 


2 331 

1J 


CTA TOT CTC 

V C I 


CTC T*C 
L I 


AT CAC 
0 0 


rr« 

8 


V 


<-fC 

T. 


tec COC 
* • 


act 
p 


CAC 
1 


AAA 
8 


AAA 
8 


OAA 
8 


cor 

ti 


OTT 

V 


AAC 
8 


2 910 
92 


2911 
33 


CAC CTC OCT 

1 m C 


AAC CTC 
f L 


OA ATC 
V X 


ACC 
T 


rcc 

8 


CCC 
0 


CCC CAC 
A 0 


ACT 
? 


rrr 
f 


AAA 

X 


<ne 
i 


CAA 

c 


OCT 
A 


TTC 

r 


AAC 
8 


2f0 

112 


2m 

113 


AAA CTC OCT 
I L A 


AAC CTC 
1 b 


TTC ATC 
f I 


en AAC 
V 8 


AAC 
8 


AAA AAA 
I 1 


ACC 
T 


ATC 

I 


COC 
9 


AAC 
8 


AAT 
8 


rw: 

!. 


CTT 

V 


CAA 
F 


1030 

132 


3031 
133 


aac a* cm 
■ t h 


T f 


ATC TCT 

a 8 


CTC 
I 


A 


TAC 

t 


TCC TTC 

8 r 


ATC 
M 


CAT CAT CCT 

o o a 


OCT 
C 


AAA 
f 


TCC 
8 


CAT 
0 




3091 UC AM AAA 

un 1 b 


AAC TWX 
I A 


MX AAC 

X a 


AAA 
8 


TCfi 
8 


ATC 

I 


CTA CTC 


AAC 
8 


ACC 
T 


CM TCT 

o s 


TTC 
F 


ACT 
T 


TTC 
f 


CAA 
t 


3130 

172 


3111 

173 


y* v* r* 


TAC CTC 

T U 


«TT AAA 
V 8 


•4JT 

C 


CTC 

L 


Obi 
8 


AAA* AAA 


nc 

r 


CAA CTC AAC 
Q I 8 


TCT 

C 


TAC 

r 


CTA 

V 


AAA 
8 


3213 
132 


mi 
in 


ATC AAC AAA 
I ■ A 


AAC AAA 
* 8 


008 ATC 

9 Z 


ATC 

: 


XAC 
T 


ATC 

I 


8AI TCT 
0 A 


ATC 

m 


TCT 
S 


XM. 
X 


CTC 
L 


ATC 

4 


TTC 
r 


TAC 

T 


AAC 
8 


3270 
312 


J371 CTC AK AAA 
31.1 T. T I 


CCC TAC 
» T 


CTC ATC 
L I 


CCC CAC 
3 0 


AM 

n 


AfC TAC 
8 T 


AAA 
8 


CTC 

l 


t 


AAC 
8 


ACT 
t 


A TV 

X 


XW 
A 


TUC 
% 





an cm mm m iu *»*|?> 
»> t t r i t « 



I-Scel coding sequence of pSCMS25 - Note the two amino acid 
N- terminal extension as compared to genuine version of the ge 



FIGURE 5 



variations flgniiMn ru e , sm £h££ 







•2 


•1 
H 


1 


X 


M 




5 

£ 


L, 


H 1 


—1 


10 

— a 














K 


• 


I 

M 


20 
X 


£ 


r 


< 


s 


a 


L 


I C 


i 


30 
H 


s 

I 


i 
t 


• 

■# 


P 
• 


s 
t 


i 
X 


S 


1 


i 


40 
k 


i 


k 




2 


A 
• 


I 


1 * 


S 




3 


c 


4 


< 


• 


1 


C 


n 


Q 


on 

r 




w 


< 


N 


K 


A 


Y f| 


0 


70 
H 


V 


C 


L 


- 


* 


I 


0 




V 


AO 

k 


3 


p 


P 


H 


K 


K 


E R 


V 


90 

N 


a 


I 


a 


N 


- 


1 


1 


T 


V 


100 
6 




0 


7 


? 


< 


H 


0 A 


' 


no 

N 


< 


L 


A 


H 


- 


: 


I 


v 


N 


120 
N 


< 


< 


I 


I 


P 


N 


N (. 




130 

e 


N 


f 


i 


t 




m 


3 




A 


140 

T 




r 


n 


ft 


A 


i 


* « 




ISO 
0 


Y 


N 


< 


N 


s 


» 


N 




s 


160 

1 




L 


H 


T 


Q 


s 


P T 




170 

z 


E 


V 




y 




V 


K 




L 


180 
R 




t 


p 


0 


L 


N 


C Y 




190 

K 


I 


M 


< 


H 




5 


I 


! 


Y 


200 

1 




S 


n 


s 


Y 


L 


1 P 




210 
N 


L 


I 


K 


9 


Y 




I 


P 


Q 


220 

n 




y 




I 


P 


N 


T I 




230 
9 


e 


T 


f 


I 


< 





EarftL 



poaitian 



thin em t» jhtHna withimt iftrtw mim fHim«rtfM^ 

•1 aad 4 Mtat natural. TK» two adda added Am to 

1 to 10: can ba daiatad 
detfiutokratad 
AOrlin-Zari tdtraiad 
4Lfiargvaa»kntcd 
43: A> mlii nil 
4* Km tainted 
91:i> takratad 

US aod 1H: L«ra teloratad 

ttiudflm taltratad 



»:lktflor2i 
40: L» ft 

• 4fc&toft 

to* ft or ft 

I M * M «4d:Xton 
P^paoa47:ltoa«r^ 

paMonftO: Ltcfi 
portion 144:11 to J5 
Ponton 143;Q»£ 
pootiae 14(342 (a £ 



FIGURE 6 

Gro«p I Intron Encoded Endoauclease. and Related Eadonucleaso 
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(Destruction : pGP 704 from De Lorenzo, with transposase 
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